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As the global HIV pandemic enters its fourth decade, increas¬ 
ing numbers of surveillance sites have been established which allows 
countries to look into the epidemics at a finer scale, e.g. at sub¬ 
national levels. Currently, the epidemic models have been applied 
independently to the sub-national areas within countries. However, 
the availability and quality of the data vary widely, which leads to 
biased and unreliable estimates for areas with very few data. We 
propose to overcome this issue by introducing the dependence of the 
parameters across areas in a mixture model. The joint distribution of 
the parameters in multiple areas can be approximated directly from 
the results of independent fits without needing to refit the data or 
unpack the software. As a result, the mixture model has better pre¬ 
dictive ability than the independent model as shown in examples of 
multiple countries in Sub-Saharan Africa. 


1. Introduction. Since the first case reported in 1981 (Centers for Dis¬ 
ease Control 1981), the global HIV epidemic has become one of the greatest 
threats to human health and development. The number of people living with 
HIV worldwide continued to grow, reaching 35 million in 2013, about three 
times more than in 1990. There have been 78 million infected with HIV and 
over 29 million AIDS-related deaths so far. The response to the HIV epi¬ 
demic has been mixed, with progress being made to reduce new infection and 
rapid improvements in survival rates among AIDS patients as antiretroviral 
therapy has become available in recent years (UNAIDS 2014). 

Countries need to ground their HIV strategies in an understanding of 
their own epidemics and their national responses. Reliable estimation and 
prediction of the HIV epidemic can help policy makers and program plan¬ 
ners efficiently allocate their resources, plan and manage the intervention, 
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treatment and care programs, evaluate their effort, and raise funds. There¬ 
fore, accurate estimation and projection of the epidemic is the foundation 
of all HIV-related studies (UNAIDS/WHO Working Group on Global HIV 
and STI Surveillance 2013). In addition, countries - especially large coun¬ 
tries and countries with geographically heterogeneous epidemics - need to 
understand the variation of the impact within the country by geographic 
area and population (Mahy et al. 2014). Having surveillance data and HIV 
estimates available by sub-national entities helps allocating limited resources 
(UNAIDS 2013). 

Most countries (158 countries submitted files in 2014) use Spectrum soft¬ 
ware to estimate the impact of HIV on their population at the national level 
(Case et al. 2014). Spectrum relies on data from HIV surveillance systems 
and the Estimation and Projection Package (EPP) to estimate trends in HIV 
incidence over time (Putures Institute 2014; Brown et al. 2014; Stover et al. 
2014). The UN AIDS Reference Group on Estimates, Modelling and Projec¬ 
tions has developed, and continues to refine Spectrum/EPP for estimation 
and short-term prediction of HIV trends (Ghys et al. 2004; Brown et al. 
2008; Brown et al. 2010; Brown et al. 2014). Due to the paucity of reliable 
information on the incidence of HIV in most countries, sentinel surveillance 
systems for HIV were designed to monitor the HIV prevalence trends. Using 
these surveillance data from 1970 through the current year, EPP recon¬ 
structs trends in HIV prevalence, incidence and mortality and makes short 
term projections. 

As the global HIV pandemic enters its fourth decade, increasing numbers 
of surveillance sites have been established which allows countries to look 
into the epidemics at a hner scale, e.g. at sub-national level. Most countries 
with generalized epidemics (epidemics in which HIV transmission is primar¬ 
ily in the general population through heterosexual sex) have historical HIV 
surveillance data from women attending antenatal clinics from year 2000 to 
present. In addition, all but four generalized epidemic countries have house¬ 
hold surveys that measure the HIV prevalence in the national population 
as well as sub-national estimates to the first sub-national administrative 
level (Marsh et al. 2014). Gurrently, the estimation and projection process 
has been implemented independently for areas in those countries. However, 
the availability and quality of HIV surveillance data used for these models 
is variable (Calleja et al. 2010). Some key components of data availability 
and quality include the number of years of data to show trends over time, 
the representativeness of the data across the country, and the accuracy of 
those data (Lyerla et al. 2008). When the sub-national areas do not have 
sufficient high quality data the models produce inaccurate results with large 
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uncertainty bounds. 

One way to improve accuracy of the estimates is to borrow information 
from neighboring areas in the case of sub-national estimates. For example 
many areas have few data points early in the HIV epidemic (between 1980 
and 2000). In such situation, if the data from neighboring areas are assumed 
to have similar trends, the trends from those areas can be used to inform the 
one in the lack-of-data area. In this paper we introduce an informative prior 
distribution derived from the mixture model to allow sharing information 
across datasets. In the mixture model, the epidemiologically implausible 
trajectories fit in the area with few data would be down-weighted by the 
area with rich data. The difficulty of utilizing the proposed model is that 
the underlying epidemic model is described by differential equations, and 
the computational cost of fitting such a model is high because the model 
parameters do not have analytic solutions. In this article, we propose an 
importance sampling method that draws posterior samples of parameters in 
the new model without needing to refit the data. 

In Section 2 we describe the EPP model used by UNAIDS, the mixture 
model, the parameter estimation, and the evaluation procedure in detail. 
In Section 3, we give results for 20 countries in Sub-Saharan Africa, and in 
Section 4 we offer conclusions and discussion for future work. 

2. Approach. In this section, we first describe a dynamic model used 
by EPP for estimating and projecting the HIV epidemic from prevalence 
data. We then introduce the mixture model that allows information to be 
shared across areas with similar epidemiological patterns. After that we 
propose a novel parameter estimation method for the mixture model that 
avoids additional runs of the dynamic model. Finally we discuss the model 
evaluation procedure. 

2.1. The EPP Model. The Estimation and Projection Package (EPP) 
is based on a simple susceptible-infected (SI) epidemiological model. The 
dynamic model is as follows : 

( 1 ) 

= E{t) - r{t)p{t)Z{t) - - a 5 o{t)Z{t) + M{t)Z{t), 

= r{t)p{t)Z{t) — HlVdeath(t) — a 5 o{t)Y{t) + M{t)Y{t), 

The population being modeled is aged between 15 and 49, and the popula¬ 
tion at time t is divided into two groups; Z{t) is the number of uninfected 
individuals, Y{t) is the number of infected individuals, N{t) = Z{t) + Y{t) 
is the total adult population size, E(t) is the number of new adults entering 
the population, p{t) = Y{t)/N{t) is the prevalence rate, and HlVdeath(t) is 
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the number of deaths among infected individuals. There is a set of param¬ 
eters defined by external life-tables such as /i(t), the non-AIDS mortality 
rate; 050 (t), the rate at which adults exit the model after attaining age 50, 
and M{t), the rate of net migration into the population. 

The infection rate, r{t), is the expected number of persons infected by 
one HIV positive person in year t in a wholly susceptible population Z{t), 
and is assumed to have the following form: 

(2) logr(t -hi) - logr(t) = A x (^0 - r{t)) - ( 32 p{t) + ^37(0, 

where 'jt = iniphes the tendency of stabilization after year 

ti- Therefore, Equation (2) imposes some common structure on the changes 
of r{t) across countries: r{t) declines if r(t) > /3o, or p{t) is too high, or the 
relative increase of prevalence is too large. Furthermore, we define to as the 
start year of the epidemic at which the prevalence is 0.0025%, and let tq be 
the infection rate at to- 

The data for EPP model estimation mainly come from ANC data which 
consist of the number of infected women Yu and the number of women tested 
Nu, for a given year t and a given clinic i. Let Wu = where 

is the inverse cumulative distribution function of the standard normal 
distribution. The data and the population prevalence - p{t) - derived from 
the dynamic system (1) are linked through a random effect model: 

Wit = ‘h ^(pt) + 1^4: Y bi + eu, 

(3) bi ~ N{0,a^), 
eu ~ N{0, fit) 

where /34 is the bias of ANC data with respect to prevalence data from 
national population-based household surveys (NPBS) (Bao 2012), and bi is 
the clinic random effect, cr^ is assumed to have an inverse-Gamma prior 
which gets integrated out in the likelihood evaluation, vu is a fixed quantity 
that depends on the clinic data and approximates the binomial variation 
(Alkema, Raftery, and Clark 2007). 

With each input set, 6 = {to,tiYo, Po, f3i, ^ 2 , ^ 3 , from Equation (1) 
and (2), there is a series of output p{t), which get evaluated in the likelihood 
based on Equation (3). With a set of informative priors incorporating em¬ 
pirical evidence and expert opinions as in Equations (4), Bayesian inference 
of the population prevalence, incidence and mortality (all derived from the 
dynamic system) can be performed for the EPP model (Bao 2012). To be 
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specific, the following informative prior distributions are used: 

to ~ Uniform[1970,1990], 
h ~ r(2) (20,0.458), 

logro ~ r(' 2 ')(0.42,0.081), 

/3o ~ r(2)(0.46, 0.192), 

^ ^ /3i ~ r(2)(0.17,0.091), 

h ~ r(2) (-0.68,0.264), 

~ r(2)(-0.038, 0.009), 

/34 ~ r(2)(0.14, 0.068). 

where T( 2 )(i^O)To) is a t-distribution with location parameter /tq, scale pa¬ 
rameter (To, and 2 degrees of freedom. The T^ 2 ) distribution has infinity 
variance which ensures a good coverage of all possible parameter values. For 
simplicity, (toTi; ^O) Poi I3i,l32, Ps, Pi) are assumed to be independent in the 
prior distribution. The dynamic model runs at a 0.1-year time step, so to and 
ti will be rounded into one decimal place when calculating the likelihood. 


2.2. The Mixture Model. In the current EPP implementation, all areas 
apply the same prior in Equation (4) independently. We refer to the current 
model using prior (4) as the independent model. The availability and quality 
of the data vary from area to area. In areas with sufficient data, the inde¬ 
pendent model can work reasonably well, while in data scarce areas it would 
result in unreliable estimates. One way to overcome this issue in the data 
scarce areas is to borrow information from other areas with similar epidemic 
trends, such as areas within the same country. A hierarchical structure can 
be useful in associating parameters among areas, as illustrated in Figure 1. 
We see that the sub-national parameter estimates affect each other through 
the national parameter estimates. 



Fig 1: A hierarchical structure for input parameters of EPP. 


For each country, let Oi = (tjo,iii,Tio, Ao, Ai,/3i2, As, A 4 ) be the param- 
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eters of area i, so 6ij is the jth parameter in the ith area, with j = 1, 2,8 
and i = 1,2,..., K. To implement a structure as in Figure 1, we can assume 
the following model: 


9ij\fij,a] ~ N[^j, (1 - poj)a]], 

(5) pj\a] ~ N[poj,pojCr]], 

(t| ~ InverseGamma [aoj, boj ], 

where poj, poj, uqj and 6oj are hyperparameters. Integrating out pj and cr|, 
we get the joint distribution of {9ij, 92j, ■ ■ ■, ^Kj), 
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which is a multivariate t-distribution with = boj/aoj. Setting a^j = 1, 
poj and aoj to be the means and standard deviations in Equation (4), we 
have the same marginal distributions between the hierarchical model (5) and 
the independent model (4) except for to. to follows a uniform distribution 
ranging from 1970 to 1990 in the independent model, and a truncated t- 
distribution also ranging from 1970 and 1990 in the hierarchical model, 
and those two distributions have the same mean and variance. As a result, 
the hierarchical model and the independent model mainly differ in whether 
allowing correlations between parameters from different areas. 

As mentioned above, the independent prior (4) works reasonably well in 
areas with sufficient data. Moreover, even within the same country, different 
areas could have heterogeneous epidemics so that the parameters are not 
obviously correlated. While on the other hand, for some other areas with 
fewer data, the correlated structure (6) is critical in achieving a reliable 
estimate. In order for the priors to be flexible enough to accommodate both 
situations, we propose the following mixture model: let foj be the prior 
density of the independent model (4) and fij be the prior density of the 
hierarchical model (6) when poj ^ 0, and define the mixture model pj as the 
following: 

(7) gj{6j) = T^ojkj{Oj) + (1 - 'noj)h{Oj\poj,<yoj^Poj), 

where iroj is the mixing probability or the weight of the independent model 
that we will determine together with poj. The resulting mixture model (7) 
accounts for the model uncertainty. 
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2.3. Hyperparameters. We first need to determine a reasonable set of the 
hyperparameters po and ttq. Here we adopt the empirical Bayes framework 
by using the point estimates from countries with multiple datasets. Let Oicj 
be the posterior mean of the jth parameter obtained from fitting the inde¬ 
pendent model to the fth area of a high-data-quality country c. vroj and poj 
could be estimated by maximizing the likelihood with known fiQj and (Tq^-: 

(8) n5i(^ci)=n[ ^Oj foi^cj) + (1 IPOjj Tqj , poj) j 

c c 

Currently, there are not many countries with high-quality datasets at 
sub-national level. Fosdick and Raftery (2012) compared approaches of es¬ 
timating the correlation in bivariate normal data with known variances and 
small sample sizes, and found that the Bayesian approach consistently out¬ 
performed several exact and approximate maximum likelihood estimators 
for small samples. Therefore, we assume a prior for pQj and vroj and set the 
posterior mode as the hyperparameter value in Equation (7). Specifically, we 
assume that ^ Beta(1.5,1.5) which is slightly concentrated towards 

poj = 0 with density 1.273 at = 9 5 ^ while the uniform distribution has 
density 1 over the domain of [0,1]. We also assume that ttoj ~ Beta(l, 1.5) 
which has mean 0.6 and density 1.5 at 1 so that it is slightly in favor of larger 
values of ttoj. Finally, we denote g = WjQj as the joint prior distribution 
of the mixture model and /o = Hj foj as the joint prior distribution of the 
independent model. 

2.4. Parameter Estimation via Importance Sampling. Estimating param¬ 
eters in dynamic models is often time consuming due to the lack of an ana¬ 
lytic solution, the multi-modality and the nonlinearity. Fitting the datasets 
from multiple areas simultaneously in the mixture model (7) will further 
increase the computing cost. It is desirable to develop a procedure that 
produces the mixture model results without additional computational cost. 
Since running the dynamic model is the most time consuming part of the 
estimation, we propose applying the independent model to fit each individ¬ 
ual dataset as currently being implemented in EPP/Spectrum 2013 (Brown 
et al. 2014) and using importance sampling to approximate the joint poste¬ 
rior distribution of parameters in the mixture model for multiple datasets. 

Applying the independent model to each individual dataset, the posterior 
samples are drawn by the incremental mixture importance sampling (IMIS) 
algorithm which has been proved to be efficient for estimating parameters 
in dynamic systems with a moderate number of parameters (Raftery and 
Bao 2010; Brown et al. 2010). It works as follows: 
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1. Draw initial samples from a sampling distribution, e.g. the prior dis¬ 
tribution, /o; 

2. Update the importance weight for each sample, which is the ratio 
between posterior density, P(0i|Datai), and density of the sampling 
distribution; 

3. Find the sample with the highest weights, and draw new ones from a 
multivariate Gaussian distribution centered around it; 

4. Combine new samples with all existing samples, update the sampling 
distribution by a mixture of initial sampling distribution and multiple 
Gaussian components, and recalculate the importance weight; 

5. Iterate between step 2 and 4 until there is no large importance weight; 

6 . Resample all samples from the multinomial distribution with weights. 

The initial sampling distribution is often flat so that it ensures a good cov¬ 
erage of the entire parameter space. As a new Gaussian sampler is placed 
in the region that is under-represented by the sampling distribution at each 
iteration, the sampling distribution gets closer and closer to the posterior 
distribution. 

To avoid a separate run of the proposed model, the step 6 of the IMIS 
algorithm can be revised as follows: 

6 .a. Repeat steps 1 ~ 5 for areas i = 1,... ,K. All samples with non-ignoble 
importance weight will be stored, e.g. greater than 10“®,; 

6 .b. Create a joint sample of parameters across all areas by randomly taking 
one sample from each area, and repeat until we have a large number 
of candidate joint samples, e.g. 1,000,000; 

6.C. The importance weight of a joint sample (0i,..., Ok) is calculated as 
the product of the importance weights for 0j’s times ? the 

ratio between the mixture model prior density and the independent 
model prior density; 

6 .d. Resample all samples from the multinomial distribution with weights. 

The resulting posterior samples will have replicates because they are 
drawn from multinomial distribution with replacement, and a large num¬ 
ber of replicates suggests a low sampling efficiency - a large discrepancy 
between the sampling distribution and the target distribution (Raftery and 
Bao 2010). The ratio between two prior densities is 



which has the lower bound Hj '^Oj y so that all combinations have non-ignoble 
reweighing factor. Therefore, limiting the importance weights helps obtain 
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desired sampling efficiency. In the result section we will show that the revised 
IMIS algorithm still provides a large number of unique samples. 

2.5. Assessing Model Fit. We compare performance of the original EPP 
model (independent model) and the proposed mixture model based on datasets 
collected from the following 20 countries in Sub-Saharan Africa: Benin, 
Botswana, Burkina Faso, Burundi, Cameroun, Chad, Ethiopia, Ghana, Kenya, 
Lesotho, Malawi, Mali, Nigeria, Republic of Cote d’Ivoire (RCI), Democratic 
Republic of the Congo (RDC), Rwanda, SierraLeone, Tanzania, Uganda, and 
Zambia. We construct two scenarios to assess the model fit by splitting the 
data into training and test in two different ways. In the first scenario we 
assess the prediction into the future by withholding the last 3 years of data 
as the test set. In the second scenario we assess both the future prediction 
and recovery of the past by using only the mid-period data as the training 
set and predict both the earlier and later data. 

3. Results. To determine the values of ttq and po, we obtain a set 
of posterior means by applying the independent model to 14 countries with 
multiple high-quality datasets, where high-quality is defined as having preva¬ 
lence data at multiple clinics for at least 7 years and at least one national 
population based survey. Here we focus on high-quality datasets because 
their point estimates are mainly driven by the data instead of the informa¬ 
tive prior distribution in Equation (4). Table 1 presents the estimated ttq 
and Po following the procedure in Section 2.3. It suggests that the correla¬ 
tions of 132 s between areas are negligible with probability vroe = 1 of using 
the independent model prior distribution. 


Table 1 

TTo and Po determined by point estimates of high-quality datasets. 



01 

02 

03 

04 

05 

06 

67 

08 


io 

ti 

log ro 

Po 

Pi 

P2 


Pi 

TTOj 

0.278 

0.249 

0.315 

0.154 

0.490 

1.000 

0.196 

0.125 

POj 

0.822 

0.925 

0.996 

0.708 

0.783 

0.000 

0.529 

0.608 


We then apply both the independent model and the mixture model to 
datasets in 20 countries that have sufficient data to evaluate the prediction 
performance by with holding the last 3 years as test datasets. Table 2 sum¬ 
marizes prediction performance in term of the mean absolute errors (MAE) 
of antenatal clinic prevalence, and the sampling efficiency in term of the 
number of unique posterior samples. 

Despite the success of the independent model with the informative prior 
distribution in many high-quality-data countries (Bao 2012), the mixture 
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model further improves 3-year prediction MAE by 0.05 on average, with (5th, 
95th) percentiles being (-0.03, 0.22). Among 41 datasets, the improvement is 
greater than 0.05 for 13 datasets, and less than -0.05 for one dataset. We also 
note that the mixture model provides better prediction accuracy especially 
for areas that have fewer data. Table 2 highlights eight areas with less than 
5 years of training data - four of them gain substantial improvements by 
using the mixture model, and the remaining four areas have little difference. 

The top four panels in Figure 2 present examples that the mixture model 
results differ systematically from the independent model results during the 
3-year period of test data. In the Sierra Leone example, the mixture model 
(blue) improves the prediction accuracy as its posterior mean is closer to the 
average of observed HIV prevalence (red) than the posterior mean of inde¬ 
pendent model (black). Moreover, Sierra Leone surveillance data starts from 
2003 in urban areas and 2006 in rural areas. Two models provided signifi¬ 
cant different estimates for the early period of Sierra Leone rural epidemic. 
It was because Sierra Leone urban data suggested the epidemic started af¬ 
ter 1980 and the rural area estimation borrowed that information in the 
mixture model. In the Burundi example, the mixture model (blue) fails to 
make a better prediction. It was because the prevalence had an unexpected 
decline in the test data period (after the red vertical line), and neither urban 
training data nor rural training data could predict that decline. It revealed 
a limitation of the mixture model: if there is a sudden change of the epi¬ 
demic pattern that could not predicted by any individual dataset, borrowing 
information from each other could not help in capturing the change. 

IMIS method (Raftery and Bao 2010) is used to draw the posterior sam¬ 
ples for the independent model. Among all datasets, it provides 801 unique 
samples out of 1,000 resamples on average, with (5th, 95th) percentiles being 
(767, 828). Adjusting the importance weights following procedures 6.a.~6.c. 
in section 2.4., the average number of unique samples for the mixture model 
is 537, and (5th, 95th) percentiles are (235, 762). It is less efficient than 
applying IMIS from the scratch in term of the number of unique samples. 
However, we avoid re-running the time-consuming EPP models and get new 
results within 10 seconds for each country. The gray curves in Figure 2 show 
the posterior samples of mixture model. It seems that simply adjusting the 
importance weights provides enough unique samples to cover the high pos¬ 
terior density region. 

In the next few years, many countries are going to produce estimates at 
finer sub-national scales, e.g. at province or district levels. The data imbal¬ 
ance across sub-national areas is a major concern - some areas have historical 
surveillance sites with many years of observations, and some other area only 
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have newly established surveillance sites with observations in recent years. 
One motivation of our model is to improve the estimation and prediction 
for low-data-quality areas by borrowing information from high-data-quality 
areas. Trying to mimic this situation, for each country, we create one “low- 
data-quality” area by keeping only the middle 6 years data as the training 
set and using both the early years data (however many years available) and 
last 3 years data as the test set in that area, and the other areas in that 
country act as “high-data-quality” areas such that only the last 3 years data 
are withheld as the test set. We also rotate the areas of “low” and “high” 
so each area gets evaluated as the “low-data-quality” area. We summarize 
the MAE’s for both future predictions and past recoveries in Table 3. The 
“Area” column indicates which area is the “low-data-quality” area. The re¬ 
sults show that the mixture model improves 3-year prediction MAE by 0.05 
on average with improvement greater than 0.10 in 8 cases and no less than 
-0.10 in all other cases. The mixture model also improves the early epidemic 
estimation MAE by 0.55 on average with improvement greater than 1.00 in 
8 cases but less than -1.00 in one case. The bottom two panels in Figure 
2 present two cases with large difference between two models in the early 
period of the epidemic, and the mixture model mean trajectory is a better 
approximation to the raw data average than the independent model. 

In addition to the examination of the marginal distribution of prevalence 
within each area, we also examine the joint distributions of the quantity of 
interest across areas, e.g. HIV prevalence and HIV incidence. We apply the 
mixture model to the full datasets of 20 countries. For every country and 
every year, we calculate the prevalence correlation and incidence correlation 
among areas. Figure 3 shows those correlations by years and by countries. 
Those correlations are mostly positive due to the positive correlation of EPP 
input parameters. Since the mixture model involves a hierarchical structure 
for the EPP input parameters which summarize the characteristics of the 
HIV epidemic, the correlations of the EPP output prevalence and incidence 
vary across time. 

4. Discussion. Planning an effective response to the HIV epidemic 
and assessing the impact of the past response requires quantitative anal¬ 
ysis. Sub-national estimates can be used for better program planning and 
management, such as assessing and meeting the needs for commodities, hu¬ 
man resources and other program elements, measuring population coverage 
of treatments, and monitoring and evaluating interventions. In this arti¬ 
cle, we describe a hierarchical epidemiological structure that assumes the 
patterns of the sub-national epidemic are similar, and suggest a mixture 
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between the independent model and newly proposed hierarchical model to 
account for the uncertainty between two model choices. For the estimation 
and projection of HIV epidemics, this will be the first attempt of develop¬ 
ing a joint model for multiple areas within a country. It will be potentially 
implemented in the next round of EPP/Spectrum software (available from 
http://www.futuresinstitute.org). The proposed model can be easily gener¬ 
alized to the study of other epidemics. 

We find that the predictive ability of EPP model can be improved by using 
the mixture model, especially for the areas that are lack of long historical 
data. The mixture model also provides more accurate assessment of the 
early period of the epidemics in many examples, but there are a couple 
of exceptions. If the mixture model and the independent model estimate 
the early trend of the epidemic very differently, we suggest choosing the 
model with the help of other available information such as the year of the 
first case report in that area. The prediction accuracy might be further 
improved by redefining the default prior distribution of independent model, 
such as allowing dependence among (to, J’Oj/^Oj/^i,/d 2 ,/Ssj/34). However, it 
is beyond the focus of this article. 

The computing cost of estimating parameters in a dynamic model is of¬ 
ten high due to the lack of an analytic solution, the multi-modality and 
nonlinearity. We propose an importance sampling method that draws pos¬ 
terior samples of parameters in the mixture model without needing to refit 
the data or unpack the existing software for implementing an independent 
model. We first obtain posterior samples of parameters for each datasets in¬ 
dependently. Since the mixture model and the independent model only differ 
at their prior distributions, we can randomly combine posterior samples of 
the independent model from multiple areas, reweight the joint samples by 
the ratio between the mixture model prior and the default independent prior, 
and draw a set of joint posterior samples from the multinomial distribution 
with weights. The mixture between the original model prior and the new 
model prior provided a lower bound on the ratio, and thus provided satis¬ 
factory sampling efficiency as measured by the number of unique posterior 
samples in our examples. If the number of unique posterior samples is too 
small, we suggest either increasing the number of random combinations, or 
using a less informative prior distribution in the independent model. A more 
efficient sampling strategy should be developed in the future. 

Note that, in this specific application, the incremental mixture importance 
sampling method is used to draw posterior samples for each individual data, 
but our proposed reweighting approach is generic and does not depend on 
how the posterior samples are drawn for individual datasets. For instance. 
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if Markov chain Monte Carlo (MCMC) is used to draw posterior samples 
for individual datasets with reasonable burn-in and thinning, we can as¬ 
sume that those posterior samples have equal weights in the independent 
model. This suggests an alternative way of estimating parameters in de¬ 
pendent data, e.g. data with hierarchical/spatial/temporal structure; fitting 
each piece of data independently, and merge the results with adjustments 
for the difference between the independent model and the joint model. 

When a country has a large geographical territory or diverse regions of 
public health conditions, it is worthy to further develop a spatial model or 
multilevel hierarchical model based on the knowledge of the national offi¬ 
cials. In those cases, the similarity of epidemic patterns may depend on the 
distance or adjacency status between areas, or the regions to which the ar¬ 
eas belong. In countries with low-level and concentrated epidemics, HIV has 
spread rapidly in several high risk groups, but is not well established in the 
general population. Fewer data are available for those high risk populations 
due to the stigmatized nature of these populations in many countries. In 
such cases, a hierarchical model can allow sharing information across areas 
and high risk groups. 

The results presented in this paper are based on illustrative HIV preva¬ 
lence data for these countries, which may not be complete. These results 
should therefore not be seen as replacing or competing with official esti¬ 
mates regularly published by countries and UNAIDS. 
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Table 2 

A summary table for the number of data-years in training dataset, the 3-year prediction 
mean absolute error of antenatal clinic prevalence in %, the number of unique posterior 
samples. Bold highlights the areas with less than 5 years of training data. 


Country 

Area 

Training Data 

Mean Absolute Error (%) 

1 Number of Unique Samples I 

# of Years 

Independent 

Mixture 

Improvement 

Independent 

Mixture 

Benin 

Urban 

18 

1.56 

1.57 

-0.01 

808 

152 

Benin 

Rural 

6 

0.88 

0.88 

-0.01 

824 

186 

Botswana 

Urban 

15 

3.07 

3.07 

0.01 

657 

460 

Botswana 

Rural 

15 

4.31 

4.14 

0.17 

793 

467 

BurkinaFaso 

Urban 

17 

0.71 

0.7 

0.02 

797 

513 

BurkinaFaso 

Rural 

5 

0.47 

0.48 

0.00 

821 

360 

Burundi 

Urban 

17 

6.66 

6.69 

-0.03 

814 

730 

Burundi 

Rural 

17 

1.52 

1.74 

-0.22 

818 

571 

Cameroun 

Urban 

13 

3.88 

3.88 

0.00 

817 

677 

Cameroun 

Rural 

2 

3.3 

3.03 

0.26 

803 

613 

Chad 

Urban 

6 

2.42 

2.37 

0.05 

811 

283 

Chad 

Rural 

2 

1.23 

1.01 

0.22 

809 

244 

Ethiopia 

Urban 

11 

2.5 

2.43 

0.08 

785 

283 

Ethiopia 

Rural 

7 

1.22 

1.22 

-0.01 

808 

374 

Ghana 

Urban 

14 

0.74 

0.72 

0.02 

803 

450 

Ghana 

Rural 

13 

0.71 

0.75 

-0.04 

822 

430 

Kenya 

Urban 

15 

3.11 

3.04 

0.07 

759 

624 

Kenya 

Rural 

12 

2.72 

2.5 

0.22 

819 

617 

Lesotho 

Urban 

8 

5.93 

5.9 

0.03 

815 

755 

Lesotho 

Rural 

8 

4.17 

4.01 

0.16 

817 

772 

Malawi 

Central 

9 

2.55 

2.58 

-0.01 

806 

719 

Malawi 

Northern 

9 

2.04 

2.05 

-0.02 

817 

817 

Malawi 

Southern 

9 

5.35 

5.27 

0.08 

797 

729 

Mali 

Urban 

3 

1.03 

1 

0.03 

803 

665 

Mali 

Rural 

3 

0.99 

0.97 

0.02 

828 

663 

Nigeria 

Urban 

7 

2.58 

2.59 

-0.01 

782 

725 

Nigeria 

Rural 

2 

2.21 

2.22 

-0.01 

821 

645 

RCI 

Urban 

14 

1.97 

1.92 

0.05 

795 

528 

RCI 

Rural 

3 

2.54 

2.52 

0.02 

824 

544 

RDC 

Urban 

20 

1.23 

1.22 

0.02 

770 

670 

RDC 

Rural 

11 

1.33 

1.34 

-0.01 

817 

525 

Rwanda 

Urban 

6 

1.88 

1.8 

0.09 

807 

529 

Rwanda 

Rural 

6 

0.87 

0.87 

0.01 

807 

355 

SierraLeone 

Urban 

3 

2.7 

2.49 

0.20 

794 

591 

SierraLeone 

Rural 

2 

1.78 

1.6 

0.18 

834 

509 

Tanzania 

Urban 

18 

1.61 

1.61 

0.00 

796 

685 

Tanzania 

Rural 

18 

1.37 

1.36 

0.00 

796 

645 

Uganda 

Urban 

20 

2.34 

2.33 

0.01 

767 

763 

Uganda 

Rural 

17 

1.59 

1.59 

0.00 

807 

797 

Zambia 

Urban 

5 

2.55 

2.5 

0.05 

797 

484 

Zambia 

Rural 

5 

3.08 

3.1 

-0.02 

818 

541 

Mean 



2.23 

2.18 

0.05 

801 

537 

5th quantile 



0.71 

0.72 

-0.03 

767 

235 

95th quantile 



4.55 

4.40 

0.22 

828 

762 
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SierraLeone Urban 


SierraLeone Rural 



Burundi Urban 




Burundi Rural 



Uganda Urban 


Uganda Rural 




Year 


Year 


Fig 2 : HIV prevalence estimation and prediction for 6 datasets that present large dif¬ 
ferences between the independent model and the mixture model. Within each hgure, the 
black solid line is the posterior mean trajectory of the original model; the blue solid line is 
the posterior mean trajectory of the mixture model; the dashed black/blue lines show the 
95% credible intervals; colored dots are observed prevalence from different ANC sites; the 
red solid line is the raw average trajectory of ANC prevalence. The red dashed vertical 
lines separate the training data and test data. 
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Table 3 

A summary table for the early data period estimation mean absolute errors and the 
3-year prediction mean absolute errors in %. 


Country 

Area 

Early Data Period MAE 

Prediction MAE I 

Independent 

Mixture 

Improvement 

Independent 

Mixture 

Improvement 

Botswana 

Urban 

9.44 

3.60 

5.84 

3.17 

2.98 

0.18 

Botswana 

Rural 

4.6 

6.16 

-1.57 

4.39 

4.33 

0.06 

Burundi 

Urban 

7.79 

7.84 

-0.05 

16.57 

16.65 

-0.08 

Burundi 

Rural 

3.83 

3.94 

-0.11 

2.15 

2.25 

-0.10 

Ethiopia 

Urban 

3.40 

3.20 

0.20 

2.29 

2.16 

0.14 

Ethiopia 

Rural 

1.44 

1.45 

-0.01 

1.16 

1.16 

0.00 

Ghana 

Urban 

1.89 

1.24 

0.65 

0.84 

0.81 

0.03 

Ghana 

Rural 

1.50 

0.91 

0.59 

0.71 

0.76 

-0.05 

Kenya 

Urban 

5.74 

5.69 

0.05 

1.87 

1.84 

0.03 

Kenya 

Rural 

4.41 

4.31 

0.10 

2.52 

2.54 

-0.02 

Lesotho 

Urban 

7.74 

6.57 

1.17 

1.18 

1.18 

0.01 

Lesotho 

Rural 

10.03 

8.23 

1.80 

2.97 

2.77 

0.20 

Malawi 

Central 

3.82 

4.77 

-0.85 

2.20 

2.25 

-0.05 

Malawi 

Northern 

7.63 

7.18 

0.45 

1.72 

1.71 

0.01 

Malawi 

Southern 

5.67 

3.64 

2.03 

4.65 

4.74 

-0.09 

RDC 

Urban 

2.18 

2.09 

0.09 

1.20 

1.20 

0.00 

RDC 

Rural 

3.27 

2.46 

1.19 

0.85 

0.85 

0.00 

Rwanda 

Urban 

9.88 

10.14 

-0.26 

1.91 

1.70 

0.21 

Rwanda 

Rural 

1.51 

1.84 

-0.33 

1.13 

0.93 

0.20 

Tanzania 

Urban 

4.85 

4.95 

-0.10 

1.55 

1.54 

0.00 

Tanzania 

Rural 

7.88 

6.53 

1.35 

1.61 

1.35 

0.27 

Uganda 

Urban 

8.47 

6.04 

2.42 

2.60 

2.45 

0.15 

Uganda 

Rural 

4.04 

2.56 

1.47 

2.18 

2.07 

0.11 

Zambia 

Urban 

7.12 

4.78 

2.34 

2.34 

2.41 

-0.07 

Zambia 

Rural 

2.99 

2.74 

0.25 

2.75 

2.85 

-0.10 

Mean 


5.01 

4.46 

0.55 

2.71 

2.66 

0.05 

5th quantile 


1.45 

1.27 

-1.43 

0.84 

0.82 

-0.09 

1 95th quantile 

9.81 

8.17 

2.41 

4.33 

4.27 

0.21 
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Prevalence Correlation 


Incidence Correlation 




Fig 3 : The prevalence correlations (left) and incidece correlations (right) between sub- 
naitonal areas produced by the mixture model. Each gray line corresponds to one country. 
The black solid line and dashed lines provide the median and (5th, 95th) quantiles. The 
red dashed line indicates a zero correlation which corresponds to the expected correlation 
in the independent model. 









